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Abstract 

In this paper, the relative efficiency of two propositional systems is 
studied: resolution and cut-free LK in DAG. We give an upper bound for 
translation of resolution refutation to cut-free LK proofs. The best upper 
bound known was 2" and we improve it to n 2+31 ° s ". 

1 Introduction 

The interest in the lengths of proofs of resolution and cut-free LK is threefold. 
In [3] , Cook and Reckhow showed that the problem of whether there exists a 
propositonal proof system in which every tautology has a polynomial size proof 
is equivalent to the problem of NP=?co-NP. In the same paper, they intro- 
duced a notion of polynomial simulation (p-simulation) to compare efficiencies 
of propositional systems so that we can order them in terms of complexity. In 
the early researches, propositional systems were separated into the equivalence 
classes of p-simulation, and the hierarchy of propositional calculi is obtained. 
Haken showed a lower bound for resolution by proving that resolution does 
not polynomially prove the pigeonhole principle [4]. Then, his technique was 
applied to cut-free LK expressed as directed acyclic graphs (DAG) to prove 
its lower bound [9]. In recent researches, it was revealed that the hierarchy 
of propositional systems corresponds to that of circuit classes. Thus, separat- 
ing two propositional systems is equivalent to or strongly related to separating 
corresponding circuit classes. Most of questions of relative strength have been 
settled for systems weaker than LK, with the exception of the relative efficiency 
of resolution and cut-free LK [10]. 

The second source of interest is classical proof theory. Most of the impor- 
tant theorems in classical propositional calculus, such as interpolation theorem 
and Beth's theorem, are obtained as consequences of Gentzen's cut elimination 
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theorem. Unfortunately, cut-elimination for propositional calculus requires the 
function 2" [6]. Even worse, reducing cuts of degree k to k — 1 requires 2" 
[1]. Consequently, theorems mentioned above remain open when we restrict our 
interest in polynomial-time algorithm [5]. For example, whether there exists a 
polynomial-time algorithm to find an interpolant for a given end-sequent is an 
open problem. 

Resolution is a system which is almost equivalent to cut-free LK plus cuts of 
atomic formulas. Finding a polynomial-time or quasi-polynomial-time algorithm 
to eliminate atomic cuts may help us to answer these open problems. 

The third source of interest is theory of automated theorem proving. Cut- 
free LK and resolution are among the most frequently mentioned propositional 
calculi in the field of automated reasoning. Many proof-search programs are 
benefited by the simplicity of cut-free LK and resolution: In cut-free LK, the 
proof of a given tautology is completely determined by the structure of the 
tautology itself by virtue of its subformula property, and resolution has only 
one inference and no axiom. Thus, the question whether there exists a feasible 
function which translates resolution proofs to cut-free LK proofs is counted as 
one of the major open problems in the theory of automated theorem proving. 
In [3] and [1], it was shown that any translation of tree resolution to cut-free 
tree LK requires 2" . The best known translation of resolution to cut-free DAG 
LK also requires 2" and it was widely believed to be its lower bound. 

In this paper, we show that 

n 2+31ogn ig 

to translate resolution into 

cut-free DAG LK. 

2 Resolution and GCNF 

A proof system is defined to be a polynomial time computable function / from 
{0, 1}* onto the set of tautologies. When a proof P of a proof system is given, 
we measure its size by the total number of symbols appearing in P, and its 
length by the total number of lines (inferences) of P. The size of P is denoted 
by size(P) and the length by len(P). 

Let Si and S2 be proof systems for propositional calculus. Si simulates S2 
if and only if there exists a polynomial function p such that for any formula A 
and any proof P2 of A in S2, there exists a Si-proof Pi of A (translated into Si 
language) so that 

size(Pi) < p{size{P2)) 

In other words, a system Si simulates S2 if Si is not less efficient than S2 as a 
proof system. In particular, we say that Si polynomially simulates (p-simulates) 
S2 if there is a polynomial-time algorithm which, given an SVproof of a formula 
A, produces an Si -proof of A. 

We begin with reviewing two propositional systems, resolution and GCNF 
[8] [2]. Both resolution and GCNF are refuting systems: we show that a given 
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formula is a tautology by showing its negation, put into conjunctive normal 
form, is unsatisfiable. 

A literal is a propositional variable or its conjugate. A clause is a finite set 
of literals which is often expressed as a product of literals. The meaning of a 
clause is the disjunction of the literals in the clause. For example l\- ■ - l n means 
l\ V • • • V /„ . (A clause consisting of a single literal is also called a literal if there 
is no danger of confusion.) Let H be a finite set of clauses. A resolution of H 
is a directed acyclic graph such that any leaf of the graph is labeled by a clause 
in H , the root by an empty clause, and any inner node associated with its two 
upper nodes by the resolution rule 

Ci U {x} C 2 U {x} 
resolution rule C\ U C'2 

where neither C\ nor C'2 contains the literals x or x. 

A cedent is a sequence of clauses which is often expressed by a capital Greek 
letter. A sequent is a string of the form r — > A, where T is called the antecedent 
and A the succedent. LK is a Gentzen style sequent calculus [7]. We restrict 
LK to obtain a propositional calculus, GCNF, on clauses rather than ordinary 
formulas. GCNF is equipped with an axiom scheme, 

axioms p,p — » 

and the following two inference rules. 

r -> 

structural inference T* — » 

where T* D T as a set. If C is a clause in T, then the successor of C is the first 
occurrence of C in T* . 

h , Ti — 7- • • • l„ , r n — > 
logical inference l\ ■ ■ ■ l n , Ti , . . . , T n — > . 

where li (i = are literals. In the logical inference, we infer a single 

sequent l\ ■ ■ -l n , Ti, T n from n (> 2) sequents, lj,Tj — > (1 < j < n). 
The indicated occurrence of the clause l\ ■ ■ - l n in the lower sequent is called the 
principal clause, and the indicated occurrences of the clauses l\ , . . . , l n in the 
upper sequents are called the auxiliary clauses of this inference. The principal 
clause is the successor of auxiliary clauses. The successors of clauses in Tj's are 
defined as those in the structural inference. 

Remark: A logical inference in GCNF is an abbreviation of several [V: right] 
inferences in LK. 

We introduce another inference rule to GCNF, called the atomic cut. 

p,T p, r 
atomic cut T 
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where p is a propositional variable. We call p the cut-formula of this inference. 
The occurrence of p (resp. p) appearing in the left (resp. right) upper sequent 
is called the left (resp. right) cut-formula. Cut-formulas have no successor. 

Let P be a GCNF or GCNF + Atomic cut proof, and C, D occurrences of 
clauses in P. C is an ancestor of D either if D is a successor of C or if the 
successor of C is an ancestor of D. A sequent S is a left-ancestor (resp. right- 
ancestor of another sequent T satisfies one of the conditions; (1) S is the left 
(resp. right) upper sequent of T, (2) S is the only upper sequent of T or (3) the 
lower sequent of S is a left-ancestor (resp. right-ancestor) of T. S is an ancestor 
of T if S is either a left- or a right-ancestor of T. Succedents in a GCNF or 
GCNF + Atomic cut proof are always empty. Hence, we omit — »'s for the sake 
of simplicity. When a propositional variable p occurs in a sequent r — > as a 
clause, we say p occurs positively in T — K When a conjugate of a propositional 
variable p occurs in T — > as a clause, we say p occurs negatively. 

Resolution can be also defined as a subsystem of GCNF + Atomic cut. 
Definition 1 A GCNF + Atomic cut proof P is resolution-like if and only if 
P satisfies the following condition. 

1. No logical inference appears below any cut. 

2. Upper sequents of logical inferences are initial sequents. 

3. No atomic formula appears more than once in any sequent. 

4. The cut-formula does not appear in the lower sequent of a cut. 

5. Non-initial sequents do not contain both a propositional variable and its 
negation. 

6. Formulas in the end-sequent does not appear as a cut-formula of any cut. 

Let s be an occurrence of a sequent. We say that s occurs in the end-part of P 
if there exists no logical inference below s. 

Lemma 1 Let H = {/, C\ Ul, . . .C n Ul, D\ Ul, . . . , D m Ul, T} be a set of clauses 
such that the occurrence of I and I are fully indicated, and none of Di (I < i < 
m) is empty. Let H* be a set of clauses, {D\, . . . , D m , T}. For any resolution, 
P, of H , there exists a resolution Q of H* such that size(Q) < size(P) and 
len(Q) < len(P). 

(Proof.) First, change every node C\ Ul for / in Q. Secondly, omit every occur- 
rence of / and I in Q. After a minor modification, we obtain a resolution proof 
of H* without increasing its size and length. □ 

By lemma 1, we can assume that no clause in the hypothesis consists of a 
single literal. 
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Theorem 1 Let P be a resolution of C\, . . . , C n . Then, there exists a resolution- 
like proof P* of C\, . . . , C n —?> such that size(P*) = 0(size(P)) and len(P*) = 
0(len(P). 

(Proof.) First, change every node labeled by C'i (1 < i : < n) in P for the 
subproof 

]i ]i ... p ]i 

l i> n l k' l k 

C- P P 

u l ; *1 ; • • • ; l k 

P P C- 

l \ 1 ■ ■ ■ ; l k ' 

where l\ ■ ■ -l l k = d . 

When nodes {x} U D\ and {x} U D2 in P are replaced by the sequents x, Y\ 
and «,r2 respectively, change the resolution of the form 

{x} U Di {x} U D 2 
£>i UD 2 

for _ 

ff, Ti a;, T 2 

Ti u r 2 Ti u r 2 
ri ur 2 . 

After minor modification (inserting some structural inferences), we obtain a 
GCNF + Atomic cut proof P* . We show that P* satisfies the conditions (1)- 
(7). (1), (2) and (3) are obvious from the construction. (4)-(6) follows from the 
nature of resolution. (7) follows from the remark given above. □ 

3 Cut-elimination 

In this section, we give a non-deterministic algorithm to eliminate all the atomic 
cuts from a given resolution-like proof to obtain a DAG GCNF proof of which 
size is bounded by quasi-polynomial of the size of the original proof. Note that 
the number of nodes in a DAG is equal to the number of different nodes in a 
tree. For simplicity, we express GCNF proofs as trees and measure their com- 
plexity by the number of different sequents contained in them. 
Definition 2 Let J 3 be a tree GCNF proof. We define its DAG-length, de- 
noted by dlen(P), by the number of the different lines in P. Similarly, we define 
its DAG-size, dsize(P), by the total number of symbols contained in different 
lines in P. 

(Cut-elimination Algorithm) 

Let i a cut occurring in a resolution-like (tree) proof of the form 

: Qi : Q2 

p, r p,t 
r . 
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We define two cut-elimination algorithms, called L-reduction and R-reduction 
as follows. 

When the L-reduction is applied to t, we eliminate the cut t to obtain a new 
proof of T,r as follows. Suppose that s is a non-initial sequent in Q\ of the 
form Ii-\,p, II2. We have a new sequent s' (= LTi, II2, T) replace s if the indicated 
occurrence of p in s is an ancestor of the left cut-formula of i. The end-sequent 
of Qi turns into r,T after the transformation. We say that s produces s' and 
that i changes s. For initial sequents of the form, p,p, we delete these sequents 
and paste the subproof Q2 instead if the indicated occurrences oip are ancestors 
of the left cut-formula of i. Suppose that t is an occurrence of a sequent in Q^. 
More than one copies oft are possibly produced by the reduction. We say that 
t produces the copies of t in the new proof. 

When the R-reduction is applied to t, we obtain a proof of T, T symmetri- 
cally to the L-reduction. Suppose that s is a non-initial sequent in Q2 of the 
form Ai,p, A2. We have a new sequent s' (= Ai, A2, T) replace s if the indicated 
occurrence of p is an ancestor of the right cut-formula of i. We say that s pro- 
duces s' and that t changes s. For the initial sequents of the form, p,p, we delete 
these sequents and paste the subproof Q\ instead if the indicated occurrences 
of p are ancestors of the right cut-formula of i. We say that an occurrence of a 
sequent t in Q\ produces the copies oft in the new proof. 

Let P be a resolution-like (tree) proof of T, and L\, . . . , t„ be the list of cuts 
occurring in P. A string of R and L of length n is called a strategy (on P). We 
denote the set of strategies (on P) by A. Note that card(A)= 2". 

For each strategy a £ A, we denote the \ th (1 < i < n) element of a by a 8 -. 

Now we define algorithms to eliminate all the cuts. Given a strategy a, we 
eliminate the cuts in P bottom-up following a: for each i (1 < 1 : < n), we apply 
a 8 -reduction to eliminate all the cuts produced from t 8 -. 

Let a be a strategy and s an occurrence of a sequent. We define the influence 
of a on s by the number of cuts which change s. Formally we define it as follows. 
Definition 3 Let a be a strategy on P and s an occurrence of sequent in the 
end-part of P. We define the influence of a on s by the number of literals / in 
s satisfying the either one of the following; 

1. There exists 1 < i < n such that a 8 - = L, and / is an ancestor of the left 
cut-formula of t 8 . 

2. There exists 1 < i < n such that a 8 = R, and / is an ancestor ot the right 
cut-formula of t 8 . 

We define the influence of a on P by the maximum influence of a on all the 
occurrences of sequents in P. Finally we define the influence (of A) on P by 
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the minimum of influences of all the strategies in A on P. When a <E A has the 
minimum influence on P, we call a a wining strategy. 



4 Main theorem 

In the previous section, we define an algorithm to eliminate atomic cuts from 
GCNF (tree) proofs. Given a strategy, the DAG lengths of the result proof 
may grow exponentially to that of the original proof. In this section, however, 
we show that there must exist a strategy which keeps the DAG-length quasi- 
polynomial to that of the original proof. Our proof is sketched as follows. 
Suppose that the influence on a given GCNF (tree) proof, P, is k. We first 
show that the DAG length of the result proof is bounded by n°( k K Then, 
we show a combinatorial theorem which gurantees that k must be bounded by 
log(n). 

Theorem 2 Let P be a resolution-like (tree) proof of T, and L\, . . . , t„ be the 
list of cuts occurring in P. Suppose that a is a wining strategy on P. Then, 
dlen((P; a)) < dlen(P) 1+2k . The size of each sequent in (P;a) is bounded by 
k ■ dsize(P) 

(Proof.) Let S be a sequent, and s an occurrence of S in P. Without loss of 
generality, we can assume that S is of the form 



Note that the final sequents in (P; a) produced from s are in the same form. 

Now we want to compute how many different sequents can be produced from 
occurrences of S in P. We can choose k' (< k) literals among l\, . . . , l m which 
satisfy either one of the condition given in definition 3. For such li, there may 
exist many (< dlen(P)) cuts of which cut-formula is li. Hence, we have at most 



different sequents produced from S. □ 

In principle, we try both R-reduction and L-reduction to every cut in P and 
adopt the strategy of which influence is minimum. However, there are cases such 
that it is apparent which reduction is suitable to adopt. Consider the following 
case; suppose that L\ is the lowermost cut of the form 





2k 



■■ Qi 



■■ Q2 



p,T 



p,T 



r 
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where the depth of Q\ is 100 and that of Q2 is 2 and k = 15. Define the subset 
A* of A by {a|«i = R}. Then, it is obvious that for any strategy a and sequents 
in Q2, the influence of a on s is less than k. The influence of A* on P cannot 
exceed k. We can find a wining strategy in .4*. 

We define S by the set of occurrences of sequents in P; 
S = {s|3a G .4(the influence of a on s is greater than or equal to k)}. 
S 7^ by the definition. 

Lemma 2 Suppose that Li is a cut in P of the form 

\Qi \ <h 

p,H p, £ 
s *'. 

Let s be an occurrence of a sequent in Q\ (resp. Q2), arid a a strategy. Define 
a* in A by 

a * _ f R ( res P- L ) */i = * 
J ' [ aj otherwise 

Then, the influence of a* on s is less than or equal to that of a on s. 

(Short-cut algorithm) 

Suppose that t is a lowermost cut in P such that it is of the form 

: Qi j Q2 
p,H p, £ 
s *' 

and that there is no occurrence of a sequent in S in Q\ (resp. (^2)- Then, we 
determine a 8 - = L (resp. a 8 - = R). 

Define the new A by {a|a 8 ' = L} (resp. {a|a 8 ' = R}). Then, the influence 
of new A on P is also k. New S (determined by new A) is obtained, and it 
may be different from the old one. (Note that new S is a subset of old S.) See 
if there is any cut which satisfies the condition above for new S. Continue the 
algorithm until there is no such cut left. 

Now we assume that we already applied the procedure given above to a given 
resolution-like proof; to some cuts, either L- or R-reduction are pre-assigned on 
some cuts and others remain unassigned. Those cuts to which neither reductions 
are pre-assigned are called free cuts. 

In the remainder of the argument, we fix P as a resolution-like (tree) proof, 
ii, . . . i n the list of free cuts in P, A the set of strategies over t, . . . , t„, k the 
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influence of A on P, and S as defined above. We define S by the set of the 
lowermost occurrences of sequents in S. 

Definition 4 Let a be a strategy on P. The path determined by a is a sequence 
of sequents si, . . . , Sh satisfying the following conditions. 

1. The end-sequent is s\. 

2. For all i, s 8 +i is an upper sequent of s 8 . 

(a) If Si is the lower sequent of a structural inference, s 8 +i is the upper 
sequent of s 8 . 

(b) If Si is the lower sequent of a non-free cut, and if L-reduction is 
assigned to the cut, then s 8 +i is the right upper sequent of s 8 . 

(c) If Si is the lower sequent of a non-free cut, and if R-reduction is 
assigned to the cut, then s 8 +i is the left upper sequent of s 8 . 

(d) If Si is the lower sequent of a free cut, iy, and ctj = R, then s 8 +i is 
the right upper sequent of s 8 . 

(e) If Si is the lower sequent of a free cut, iy, and ay = L, then s 8 +i is 
the left upper sequent of s 8 . 

3. In the end-part of P, Sh is one of the uppermost sequents. 

Lemma 3 Let a be a strategy on P. Then, there exists an occurrence of a 
sequent in S on the path determined by a. 

(Proof.) Obvious from the reduction defined by the short-cut algorithm. 

Converse is also true; for any s £ S, there exists a strategy such that s is on 
the path determined by the strategy, and the influence of a on s is equal to k. 

Lemma 4 Let s £ S and a £ A. Suppose that s is on the path determined by 
a. Then, the influence of a on s is equal to k. 

(Proof.) It suffices to show that for any [3 £ A, the influence of [3 on s is less than 
or equal to that of a on s, but it is obvious from the definition of influence. □ 
Definition 5 Let s be an occurrence of a sequent and p a variable. We say 
s is on the left (resp. right) tree of p if there exists an occurrence of a cut t 
such that i is the uppermost cut below s of which cut-formula is p and s is a 
left-ancestor (right-ancestor) of the lower sequent oft. 

Lemma 5 Let S be a sequent, s occurrences of S in P. Suppose that s is in 
the left (resp. right) tree of some variable p. Then, p cannot occur negatively 
(resp. positively) in S. 
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(Proof.) Without loss of generality, suppose that s is in the left tree of p. Sup- 
pose that p occurs negatively in S. By the definition of resolution-like proofs 
and lemma 1, the clause p in s must be eliminated by the uppermost cut of 
which cut-formula is p below s. It contradicts with the hypothesis that s is in 
the left tree of p. □ 

Lemma 5 tells us that if s is an occurrence of S, and if s is in the left (resp. 
right) tree of p, either the following conditions hold; 

1. p occurs neither positively nor negatively in S, 

2. for any occurrence oft of S, t must also occur in the left (resp. right) tree 
of p. 

Lemma 6 Let S be a sequent, s,t occurrences of S, and p a variable. Suppose 
that s occurs in the left tree of p and t in the right tree of p. Then, neither s 
nor t is influenced by any cut of which cut-formula is p. 

(Proof.) Straightforward from lemma 5 and the definition of influence. □ 

Next we define a new notion, an effect, which is a similar concept to an 
influence. 

Definition 6 Let t £ S of the form T and t a free cut below t. We say that t 
effects t if and only if T does not have occurrences in P such that one is in the 
left tree of p and another in the right tree of p. 

We define the effect on t by the number of cuts which effect t, and the effect 
on P by the minimum of effects on all the occurrences of sequents in S. 

Lemma 7 Let s £ S, and a £ A. Suppose that s is on the path determined by 
a. Then, the effect on s is greater than or equal to the influence of a on s. 

(Proof.) By lemma 6, t effects s, if it influences s. □ 

Lemma 7 guarantees that the effect on P is greater than or equal to the 
influence on P, which is k. 

Now we show the following. 

Theorem 3 When the influence on P is k, k < log(dlen(P)). 

We prove the theorem by showing that the number of different sequents in S 
must be greater than or equal to 2 k . 
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Before we start the proof, we extract "critical informations" from P and 
construct a backbone of P. For example, suppose that P is of the form 



Pi, Pi 



P'i,P'i 



Pi, Pi 



P'i,P'i 



Pi, Pi 



P'i,P'i 



Pi, Pi 



P'i,P'i 



Pi,Pi,P'i P'i,Pi,P'i 

PlP'2,Pi,P2 
Pl,P'2,PlP'2 



Pl,Pl,P2 P2,Pl,P2 
PlP2,Pi,P2 
Pl,P2,PlP2 



Pl,Pl,P2 P2,Pl,P2 
PlP2,Pi,P2 
Pl,P2,PlP2 



Pl,Pl,P2 P2,Pl,P2 
PlP2,Pi,P2 
Pl,P2,PlP2 



(sl =) Pl,P2,PlP2,PlP2 (s2 =) Pl,P2,PlP2,PlP2 («3 =) P\ , p 2 , PlP2 , PlP2 («4 =) pi , p 2 , PlP2 , PlP2 



*2 



P2,PlP2,PlP2 



P2,PlP2,PlP2,PlP2,PlP2 



(t =) p 2 ,PlP2,PlP2 
P2,PlP2, Pi P2,PlP2, Pi P2 



(U =) P\P2,PlP2,PlP2,PlP2 

where l\,l 2 ,L3 are the list of free cuts, k = 2, and S = {sl, s2, s3, s4}. Then, 
we extract the information consists of follows to get the backbone of P; 

1. sequents in S 

2. free cuts and their lower sequents, where the lower sequents of cuts are 
labeled by the cut-formulas. 

In this particular example, the backbone of P is illustrated as Figure 1 



*3 







pi 


pi 







Figure 1: 



Notice that there are 4 (= 2 2 ) different sequents in S while the influence on 
P is equal to 2, which happens to be equal to the effect on P. We need technical 
definitions to count the number of different sequents in S. 

Definition 7 A tree evolved from P is a binary labeled tree T satisfying the 
following conditions. 

1. The number of edges leading out from a node is either 2 or 0. 

2. Every leaf is labeled by a sequent in S . 
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3. Every inner nodes are labeled by a variable. 

The terminology for proofs, such as ancestors and D AG-length, are also used 
on trees evolved from P, and they are defined in a similar manner. However, 
we need to be careful about the definition of left and right tree. 
Definition 8 Let Q be a tree evolved from P. A leaf s of Q is in the left tree 
(resp. right tree) of a variable p if there exists a node t such that t is the upper- 
most node labeled by p below s and s is a left-ancestor (resp. right-ancestor) oft. 

We define effects on the occurrences of sequents exactly as we did for resolution- 
like proofs. The backbone of P is obviously a tree evolved from P. We construct 
a sequence of trees evolved from P, To,T, . . ., so that there exists / > o such 
that leaves of 7] are all different and the effect on T/is equal to that on To. 

Define To by the backbone of P. Let k* be the effect on T). We define m* 
by the number of different variables labeled on inner nodes in T) . 

For i > 0, we construct T+i from T as follows. Suppose that the root of T 
is labeled by p. Define L\ by the set of sequents labeled to the leaves in the left 
tree of p, and by those in the right tree of p. Note that any leaf labeled by 
a sequent in L\ fl L2 is not effected by p. 
(Case 1) 

Suppose that card(Li) > card(L2). Let / be a matching between L1/L2 and 
L'z/Li. (A matching between sets D and E is a set of mutually disjoint un- 
ordered pairs {i,j}, where i £ D and j £ E.) For every node t labeled by p, we 
remove t and all the branches leading out from its left upper node. As a result, 
we obtain a tree, T* , which is also evolved from P, but the effect may decrease 
by 1. We construct a branch, called B s , as Figure 2. 



s 



P 



Figure 2: 

Now we construct a new tree T+i from T* as follows. For every leaf in T* , 
which are labeled with f(s), we graft the branch B s onto the left-hand side of 
the edge adjacent to the leaf (Figure 3). 
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Figure 3: 



T 8 '_|_i is a tree evolved from P as is TJ- Note that p effects leaves labeled by s 
and f(s) since it is labeled to the uppermost node below them. f(s); the leaves 
labeled by f(s) are all effected by p in Ti + \. The effect on TJ+i is equal to fc* 
assuming that the effect on TJ- is also k* . No repeating occurrence of p in any 
path in TJ+i . 
(Case 2) 

Suppose that card(Li) < card(Li2). We define an algorithm to transform TJ- in a 
similar manner to the algorithm given in (case 1), excepting that we interchange 
left and right throughout. As a result, we obtain a tree TJ+i evolved from P , 
on which effect is k* assuming that the effect on TJ- is also k* . 

Lemma 8 Let p be a variable. There is no repeating occurrence of p in any 
path in T m t . 

Lemma 9 There exists I > m* such that no two leaves in T; have the same 
label. 

(Proof.) Let s £ S_ and i > 0. Define overlap of s in denoted by overlap(s, TJ-), 
the number of leaves labeled by s in TJ- . The overlap of TJ- is defined by the max- 
imum of overlaps of all the sequents in iS. We also define £/je distance o s in 
Ti, denoted by d(s,Ti), by the smallest number of edges connecting two nodes 
labeled by s. The distance of d(Ti), is defined by the minimum of distances 
of all the sequents hit the maximum overlap. For any i > m* , d(Ti) < 2m* and 
overlap(Ti) > 1. We show for each i > 0, one of the following condition holds. 

1. Labels of leaves in TJ- are all different, 

2. overlap(Ti + i) < overlap(Ti), or 
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3. d(T i+l ) > d(Ti). 

Suppose that neither 1 nor 2 holds. Then, there exists s such that s is hitting the 
maximum overlap in TJ+i, and overlap(s,Ti) > overlap(s,Ti + i). Let p denote 
the label of the root of TJ-. Since overlap(s,Ti) > overlap(s,Ti + i), s is occur- 
ring either in the left or right tree but not both. Suppose that s is relocated 
when we transform TJ- into TJ+i; s is labeled on leaves in B s , and grafted onto 
the edge adjacent to f(s). Then, overlap(s,Ti) = overlap(f(s),Ti + i). At the 
same time, d(f(s), > d(f(s), T 8 ). Suppose that s is not relocated. Then, it 

is apparent that d(s, > d(s, T 8 ). In either case, we have d(Ti + i) > d(Ti). □ 

Now we resume the proof of theorem 3. 
(Proof of theorem.) For any leaf s in 7], the number of edges connecting the 
root of Ti and s must be at least k* since the effect on T; is k* . That means T; 
has more than 2 k leaves. By lemma 4, there are at least 2 k different sequents 
in S. Since k* > k, we have k < log(dlen(P)). □ 

As a result of theorem 1, 2 and 3, we obtain the following result; cut-free 
Gentzen system (LK) quasi-polynomially simulates resolution. 

Theorem 4 For any resolution refutation P of a set of clauses C\ , . . . , C'k of 
length n and size m, there exists a cut-free LK proof P* of C\, . . . , C'k —> of 
length n 1+2lo s n and size m 2+3iogn^ 
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